Gaze-Responsive Advertisement

ABSTRACT

This disclosure provides a system by which content/service providers can enable gaze-responsive advertisement/targeted content mechanisms for CPE devices like STB&#39;s/OTT streaming devices or systems or display devices. The system creates a new opportunity for content/service providers to advertise and helps in generating revenue. The system leverages using gaze-based technology in CPE devices like STB&#39;s or large display devices. A gaze-based algorithm helps the CPE devices to locate the viewer point of attention, using which gaze-responsive advertisement/content targeting can be achieved. The disclosure provides an end to end solution—from content creation to content consumption for gaze-responsive advertisement/media delivery.

CROSS REFERENCE TO PROVISIONAL APPLICATION

This application claims priority under 35 U.S.C. § 119(e) from earlierfiled U.S. Provisional Application Ser. No. 62/698,741, filed on Jul.16, 2018 and is incorporated herein by reference in its entirety.

FIELD OF THE INVENTION

The present embodiments relate generally to human-machine interfaces,and specifically to interfaces that combine multiple user interactionmodalities. This disclosure provides new methods by which content orservice provider can use gaze-based technology for advertisement andrelated targeted or associated multimedia content.

The embodiments disclosed herein relate to attentive user interfaces forimproving communication between humans and devices. More particularly,this invention relates to use of eye contact/gaze direction informationby technological devices and appliances to more effectively communicatewith users, in device or subject initiated communications.

Terminology

The following acronyms are used in this disclosure:

STB Set top box AOI Area of Interest SEI Supplemental EnhancementInformation STB Set-Top Box CPE Consumer Premises Equipment MPEG MotionPictures Expert Group AVC Advanced Video Codec NAL Network AbstractionLayer

BACKGROUND

Interaction with technological devices is becoming an ever-increasingpart of everyday life. However, effectiveness and efficiency of suchinteraction is generally lacking. In particular, when seeking userinput, devices such as computers, cellular telephones and personaldigital assistants (PDAs) are often disruptive, because such devicescannot assess the user's current interest or focus of attention. Moreefficient, user-friendly interaction is desirable in interactions withhousehold appliances and electronic equipment, computers, and digitaldevices.

One way that human-device interactions can be improved is by employinguser input such as voice and/or eye contact, movement, or position toallow users to control the device. Many previous attempts relate tocontrolling computer functions by tracking eye gaze direction. Forexample, teachings for systems for controlling a cursor on a computerscreen based on user eye gaze direction or to controlling informationtransfer, downloading, and scrolling on a computer based on thedirection of a user's eye gaze relative to portions of the computerscreen.

Use of eye information for interaction with devices other than computersis less common. Some systems teach activation of a keypad for a securitysystem, also using an eye tracker while other systems employ detectionof direct eye contact. While it is evident that considerable effort hasbeen directed to improving user-initiated communications, little workhas been done to improve device-initiated interactions orcommunications.

Normally advertisements or associated media (banner, PPV alert, ticker)will be broadcasted on time slots for CPE devices without viewerinteraction. There are no viewer gaze-responsive advertisement/mediadelivery mechanism available for CPE devices like set top boxes or overthe top devices (STB's/OTT).

Currently the gaze-responsive advertisement/media delivery mechanismsare available for computer displays. In the available mechanisms, thearea of interest needs to be always fixed. In case of active videosequence, it cannot be expected that the advertised object/items alwaysremain in fixed location. So, these mechanisms cannot be directlyapplied for larger display devices.

Thus, there is a need for methods by which content or service providerscan use gaze-based technology for advertisement and related targeted orassociated multimedia content.

Popular methods for gaze detection algorithms include open source opensource gaze detection algorithm and head pose estimation using OpenCVand Dlib.

This opensource algorithm for gaze detection can provide 95% accuracy ingaze detection. Eye Recognition techniques based on Eigen eyes methodsare also popular.

SUMMARY

An embodiment includes an apparatus comprising an IP camera configuredto continuously capture an image associated with a viewer, a receiver, agaze processor to identify view-gaze coordinates and gaze direction anddetermining the viewer attention area; media content comprising acollection of gaze metadata, a gaze metadata parser to parse thecollection of gaze metadata; and a gaze ad module to derive an area ofinterest (AOI) based on the collection of meta data. The AOI is comparedto the viewer's attention area in order to select a gaze ad URL fordisplay to the viewer. The gaze advertisement module sends data to anadvertisement server. The gaze advertisement module data can be a gazead URL and is sent to an advertisement server. The gaze advertisementmodule gets the ad video content from advertisement server that will beshown to user. In one embodiment, the apparatus is a set top box. Inanother embodiment, the apparatus is a large display device. In yetanother embodiment, the apparatus is a streaming device.

In another embodiment, the gaze metadata comprises (x,y) coordinates foreach advertisable item, quadrant info, an advertisement URL for eachadvertisable item, a time frame for each advertisable item, and a videosource resolution to which the (x,y) coordinates are relative.

Another embodiment includes a system for creating gazed-basedadvertising content, the system comprising, an incoming video stream, animage processing module identification module to receive the incomingvideo stream and identify advertisable items in a frame and provide alist of bitmaps for the advertisable items, an advertisement librarycomprising a list of advertisable items and a corresponding URL to eachadvertisable item, wherein the advertisable items are in a bitmapformat, an ad mapper configured to compare the bit maps from the imageprocessing module to the bitmaps from the advertisement library in orderto provide the URL for the advertisable item; and an image processingmodule to provide the advertisable item coordinates and URL associatedfor advertisement to an advertisement server.

One method to insert advertisement data based on a user's gaze locationcomprises detecting a face associated with a viewer, extracting selectfacial features of the face of the viewer, using a gaze detectionalgorithm to determine (x,y) coordinates of the viewer attention on adisplay device, receiving, from a gaze meta data parser, gaze metadatacomprising (x,y) coordinates representing an advertisable item,receiving a video format, calculating using the (x,y) coordinates andthe video format, an Area of Interest AOI corresponding to theadvertisable item, comparing the AOI with the (x,y) coordinates of theviewer attention, and if the AOI matches the viewer attention, select anadvertisement URL and render the advertisement.

In one embodiment, the method further comprises a pre-determined timethreshold during the which the comparing the AOI with the (x,y)coordinates of the viewer attention is considered valid.

In yet another embodiment, a method for gaze advertisement comprisesdetecting a face associated with a viewer, extracting select facialfeatures of the face of the viewer, using a gaze detection algorithm todetermine (x,y) coordinates of the viewer attention on a display device,receiving a video format, receiving, from a gaze meta data parser, gazemetadata comprising the number of quadrants into which a video frame isdivided, calculating using the meta data, the quadrant corresponding tothe advertisable item, determining if the (x,y) coordinates of theviewer's attention to the display corresponds to the quadrant of theadvertisable items. If the coordinates fall on the quadrant, anadvertisement URL is selected, and then the advertisement is rendered.

In an embodiment, the method further comprises a pre-determined timethreshold during the which the determining if the (x,y) coordinates ofthe viewer's attention to the display corresponds to the quadrant of theadvertisable items is valid.

Yet another embodiment comprises capturing, using an IP camera, an imageof at least one face associated with a respective viewer, detecting, bythe image processor, a gaze direction of the at least one viewer,wherein the detection of the gaze is determined by head pose estimationmethods, receiving a video stream into a gaze metadata parser comprisinggaze metadata, determining if the metadata contains if advertisable itemis present in the left half of the frame or the right half of the frame,determining if the viewer's gaze direction with respect to the displaycorresponds to the determined half of the frame which comprises theadvertisable item; and if the gaze direction corresponds to the half ofthe frame which comprises the advertisable item, then render theadvertisement.

One method further comprises a pre-determined time threshold during thewhich determining if the viewer's gaze direction with respect to thedisplay corresponds to the determined half of the frame which comprisesthe advertisable items is valid.

In certain embodiments, gaze metadata comprises (x,y) coordinates foreach advertisable item, quadrant information, an advertisement URL foreach advertisable item, a time frame for each advertisable item, and avideo source resolution to which the (x,y) coordinates are relative.

BRIEF DESCRIPTION OF THE DRAWINGS

The disclosure is herein described, by way of example only, withreference to the accompanying drawings, wherein:

FIG. 1 is a block diagram 100 that schematically illustrates thefunctional components of the system implementing the user interface inaccordance with an embodiment of the present invention;

FIG. 2 is another block diagram 200 that schematically illustrates thefunctional components of the image process module at content creation,in accordance with an embodiment of the present invention;

FIG. 3 is an illustration of the system described in the FIGS. 1-2 and4-6 in a live environment;

FIG. 4 is a flow diagram that schematically illustrates a method ofdetecting a direction of a gaze of a user, in accordance with anembodiment of the present invention;

FIG. 5 is a flow diagram that schematically illustrates a method ofdetecting a direction of a gaze of a user, in accordance with anembodiment of the present invention; and

FIG. 6 is a flow diagram that schematically illustrates a method ofdetecting a direction of a gaze of a user, in accordance with anembodiment of the present invention.

DETAILED DESCRIPTION

This disclosure provides methods and apparatus by which content/serviceproviders can enable gaze-responsive advertisement/targeted contentmechanisms for CPE devices such as STB's/OTT streaming devices orsystems or display devices. This disclosure creates new opportunity forcontent/service provider to advertise and helps in generating revenue.

Embodiments disclosed herein leverage using gaze-based technology in CPEdevices like STB's or large display devices. The gaze-based algorithmcan assist the CPE devices to locate the viewer point of attention,using which gaze-responsive advertisement/content targeting can beachieved. The disclosed embodiments herein provide end-to-endsolutions—from content creation to content consumption forgaze-responsive advertisement/media delivery.

At the content distribution site, the gaze sensitive data isinserted—for example (x,y) co-ordinates/frame quadrant in the media.This data helps the display device to identify the targeted media.

At the content consumption site, the CPE device uses available gazedetection algorithm(s), maps the viewers gaze to the correspondingtargeted media and subsequently selects/presents the targeted media.

Eye-gaze detection and tracking have been an active research field inthe past years as it adds convenience to a variety of applications. Itis a significant untraditional method of human computer interaction.Head movement detection has also received attention and interest as ithas been found to be a simple and effective interaction method. Theseserve a wide range of functions. For both eye tracking and head movementdetection, several different approaches are used to implement differentalgorithms for these technologies.

FIG. 1 is a block diagram 100 that schematically illustrates thefunctional components of the system implementing the user interface inaccordance with an embodiment of the present invention. Multiplexer(MUX) 101 receives audio input 102 from an external source. MUX 101 alsoreceives video input 103 from an external source. The Video Multiplexertool embeds geospatial information from a metadata file to a video toproduce a geo-enabled video file, which can be visualized and used fordata capture in the Full Motion Video (FMV) player. With the propermetadata, users can display the video footprint, sensor location, andlook angle on a map as they plays the video. After processing, MUX 101outputs video stream 105 to an Image Processing Module 201. Afterprocessing, the Image Processing Module 201 sends signal 112 back to theMUX 101 which then outputs a stream 135 to transmitter 120.

Signal 135 is then transmitted to display device 140. Display device 140can be a set top box. Set top box 140 comprises a receiver 143, IPcamera 141 to detect the gaze 152 of viewer 150, gaze processor 145,gaze meta data parser 147 and gaze ad module 149. The output 151 fromdisplay box 140 is sent to advertisement server 130. The gazeadvertisement module 149 sends data 151 to an advertisement server 130.The gaze advertisement module data 151 can be a gaze ad URL and is sentto an advertisement server 130. The gaze advertisement module 149 getsthe advertisement video content from advertisement server that will beshown to user 150. In one embodiment, the apparatus is a set top box. Inanother embodiment, the apparatus is a large display device. In yetanother embodiment, the apparatus is a streaming device.

In all flowcharts, a reasonable time threshold is used to match viewer'sgaze data with the AOI provided by gaze metadata, to avoid falsepositive matches, when deciding to select/play the advertisement.

FIG. 2 is a block diagram 200 that schematically illustrates thefunctional components of the system implementing the user interface inaccordance with an embodiment of the present invention. Block diagram200 illustrates the Process Module at content creation. Image processingmodule 201 comprises an identification module 210 which is configured todetermine the identification of advertising items into a frame. Module210 accepts video input 205 and provides a list of bit maps 216corresponding to each advertising item.

An advertising library 230 comprises a bit map list of items 231 andtheir corresponding URL 233. The advertising library 230 and the module210 provide input 327 and 216 respectively to Ad Mapper 220. The admapper 220 compares the bit map in input 216 to the bit map in input 237and provides the URL for the advertising item.

Ad Mapper 220 output 240 comprises gaze meta data 242 which comprisesthe URL stream 222 as well as ad item coordinates 212. The advertisinglibrary can be managed by the service provider.

Block diagram FIG. 2 illustrates an embodiment for gaze metadatainsertion; however, how advertisable objects are identified is out ofscope of invention. How advertisable objects are identified can be doneby content creators or service providers, either manually or using sometools. In FIG. 2, the URL provided by Ad mapper for specific ad item, isa generic URL that will be understood by Ad server. At the time of adcontent download in the display device (TV/STB), the Ad server willprovide the content relevant at that point of time. This avoids changingthe stream that is transmitted from the content creation/transmissionend.

In gaze metadata insertion methods described herein, the gaze metadatacan have following details:

-   -   X,Y Co-ordinates for each Ad object (Or Quadrant info)    -   AD URL for each Ad object (Video URL, Ad Poster URL)    -   Time frame for each Ad object    -   Video Source resolution (for which X, Y co-ordinates are        relative to)

In all scenarios, a reasonable time threshold is used to match viewer'sgaze data with the AOI provided by gaze metadata, to avoid falsepositive matches, when deciding to select/play the advertisement.

FIG. 3 is an illustration of a system 300 as also shown in FIGS. 1 and 2in an example home setting 380. In setting 300, users 320, 321, and 322are interacting with a device 325, for example, watching a televisionshowing content or programming 340 supplied, for example, via a set topbox or other consumer premises equipment (CPE).

As shown in FIGS. 1, 2 and 4-6, a camera located in the CPE is used tocapture gaze data associated with each viewer and used to determine thegaze profile of each user. A remote ad server 360 comprises user profilemapping 310 and AOI to ad mapping 330 capabilities. The user profilemappings of the ad server 360 are shown in table 310. These mappingsshow the profiles for each user based on each user's gaze mapping. Inthis example, the area of interest (AOI) in the programming current tothe users, is the wristwatch 342 on the wrist of the man displayed onthe screen of television 325. Module 330 stores the various areas ofinterest (AOIs) 331, 332, 333 with URLs 335, 336, 337 to correspondingadvertising.

In this example, the wrist watch 342 shown in the programming 340matches the entry in the table 330 to AOI 2 332 which corresponds to theURL 336 to advertising of the targeted product (in this case the wristwatch 342). The advertising server 360 receives the information aboutthe desired advertisement related to AOI 332 and its URL 336 and rendersthe advertisement content to be inserted into the programming content340 shown on the viewer device 325.

The method shown in FIG. 4 illustrates a method 400 for gaze advertisingusing the (x,y) coordinates of the advertising item. Beginning at step410, the IP camera captures the viewer's image. At step 430 gazedetection is accomplished. At step 430 the (x,y) coordinates of a userattention on the display device is output. At step 450, a gaze meta dataparser parses the data. The data here refers to media content (videostream) that contains the gaze meta data inserted at contentdistribution end. The gaze metadata parser receives gaze meta data fromthe media content (different embodiments to how gaze metadata can bepart of the media content are later disclosed).

At step 470 an area of interest (AOI) is calculated. The area ofinterest corresponds to the advertising items. At decision 490, the areaof interest is compared to the (x,y) coordinate to see if it matches theviewer attention. If yes, at step 495, the method then moves to selectthe advertising URL and render the advertisement.

FIG. 5 illustrates a method 500 for gaze advertisement using a quadrantmethod. Beginning at step 501, the IP camera captures the viewer image.At step 510 gaze detection is accomplished via a method referred toherein. After gaze detection is accomplished, at step 510 the (x,y)coordinates of the user attention on the display device are output.Module 510 outputs the (x,y) coordinates of user attention on thedisplay device. At step 530, a gaze meta data parser parses the datarelated to the user's gaze. In this method, metadata should contain thenumber of quadrants into which the frame is divided.

At decision step 550, the (x,y) coordinate of the viewer attention iscompared to see if it falls on a quadrant, and which one. If yes, atstep 580, the advertising URL is selected and the advertisement isrendered for insertion into the viewer's programming.

In all flowcharts depicting methods herein, a reasonable time thresholdis used to match viewer's gaze data with the AOI provided by gazemetadata, to avoid false positive matches, when deciding to select/playthe Ad.

The process in FIG. 6 depicts a method 600 for gaze advertising usinghead pose estimation. The increased popularity of the wide range ofapplications of which head movement detection is a part, have increasedthe size of research aiming to provide robust and effective techniquesof real-time head movement detection and tracking. The field ofreal-time head movement detection has received much attention fromresearchers. There are many different approaches for head poseestimation. In a machine, such as a computer, vision, the pose of anobject refers to its relative orientation and position with respect to acamera. The pose can change by either moving the object with respect tothe camera, or the camera with respect to the object.

The pose estimation disclosed can be one of many methods and is outsideof the scope of the disclosed embodiments. The goal is to find the poseof a viewer when there is a calibrated camera, as shown in the figures,and, for example, the locations of n 3D points on the object and thecorresponding 2D projections in the image are known or can beascertained.

Beginning at step 601, the IP camera captures the viewer image. At step610 gaze detection is accomplished using image processing that uses thehead pose estimation method. At step 610 the gaze direction 615 isoutput. At step 630, a gaze meta data parser parses the data related tothe user's gaze direction. In this method, metadata contains informationthat is used to determine whether the advertisement is present in theleft or right half of the frame.

At decision step 650, the gaze direction is compared to the meta datafor a match. If there is a match, the method advances to step 680, andthe corresponding advertising URL is selected, and the advertisementcontent is rendered.

Gaze Processing Algorithms

As disclosed and further expounded upon above in the figures, gazeresponsive advertisement for larger display needs are disclosed. In suchembodiments, content providers and/or service providers insertgaze-related information as meta-data, such as the location of theobject/item that needs to be advertised in the video frame, using whichthe display device can identify the advertised item or object. Thedisplay device/STB, as described above, will parse the gaze meta-data.Using the gaze-meta data, the display device can calculate the area ofinterest (AOI).

The IP camera-enabled display device/STB can contain gaze processorwhich can help the display device to identify the viewers point ofattention. If the viewers point of attention matches with AOI derivedfrom gaze meta data, the display device/STB can tune to theadvertisement present in advertisement server.

Insertion of Gaze Data

In the disclosed embodiments, the content provider or service providercan add the gaze data in any preferred method. The gaze data comprisesdifferent types of meta data. The Gaze meta data comprises one or moreof the following types of data:

-   -   a. (x,y) Co-ordinates, height and width of the object that needs        to advertised. Using this information, the display device can        derive the AOI.    -   b. The source resolution of the video frame. Using this        information, the display device can scale the AOI before trying        to map viewers point of attention.    -   c. The advertisement URL of the advertisement present in the        advertisement server.

This URL can be used by a display device/STB to select an advertisement.

The gaze detecting processing can be simplified by several methods. Oneof the methods to make the gaze detection simpler for the display deviceis to utilize quadrant data. In this method, instead of using (x,y)co-ordinates along with height and width, the service/content providercan indicate the quadrant where the advertised object is present in thevideo frame. One embodiment using this method could be that the videoframe is divided into four (4) quadrants and the content/serviceprovider can indicate in which quadrant (1, 2, 3, or 4) the advertisedobject is present.

To identify the AOI using the gaze-data mentioned above, a displaydevice needs a complex gaze detection algorithm to identify the accurateco-ordinates of a viewer's eye attention on a display screen. Detectingthe orientation of a viewer's face can be much easier compared todetecting the exact location of a viewer's' attention on display screen.Detecting the orientation of a viewer's face can utilize a less complexalgorithm compared to detecting the accurate gaze co-ordinates of aviewer's face.

To ease the complexity of gaze detection algorithm on display device,the disclosed embodiments use different set of gaze-data. This set ofgaze-data comprises:

-   -   a. JPEG image URL—the image that is targeted to be displayed on        display device.    -   b. Direction: Indicates the display device where it needs to        display the JPEG image. The Direction can be LEFT or RIGHT. If        Direction is LEFT, the display device should present the JPEG        poster on the left side of the screen. When this direction        matches with the orientation of viewer's face detected by        gaze-detection algorithm, then display device should select the        advertisement.    -   c. Advertisement URL. —Location of the advertisement.    -   d. Duration: Indicates the amount of time the display device        should display the JPEG image.    -   e. AdTriggerDuration: This data is optional and indicates        minimum of amount of time the viewer should gaze at the poster        consistently after which advertisement can be presented.

The gaze meta data can be part of SEI NAL of H264/AVC or can be part ofseparate data PID in MPEG2 Transport system or can be part of meta dataWebVTT.

An example WebVTT for gaze meta data comprises:

WEBVTT VTTKind: metadata VTTDataFormat: JSON VTTDataHint: Eye-Gaze DataWith the payload of each cue being the JSON itself. The sample payloadof JSON object can be 00:00.000 --> 00:01.000 {   ″X-CoOrdinate″:720,  ″Y-Coordinate”: 1280,   “Height”: 200   “Width”:250   ″SourceResolution”: 1080p   ″Advertisement Url″: ″http://10.232.72.84/Eye-Gaze/Rolex.mp4″ }

Gaze Processing in the CPE Devices

The CPE device can contain a wide-angle IP camera which can beconfigured to capture the viewers face and feed these frames to the gazeprocessor.

The gaze processor can comprise the following features:

The CPE devices can have image processing capability which can processthe frames captured by the IP camera and detect the viewers' attention.

Open source image processing libraries like OpenCV, Dlib are availablewhich can be used to implement the gaze detection

Gaze detection algorithm can be a complex algorithm which could derivethe exact location (x,y) co-ordinates at which viewers eyes are gazing.

To simplify the gaze detection, in some embodiments, a much easier gazealgorithm can be used to just provide information related to viewer headpose and orientation.

Based on the image processor complexity and CPE device configuration,multi-viewers scenario can be considered by which targeted advertisementcan be achieved.

The CPE device can parse the gaze-meta data present in the content andcalculate the area of interest.

If viewers' attention obtained by the gaze detection algorithm matcheswith the area of interest calculated using the gaze-meta data, thenadvertisement will be displayed.

Gaze user profile: If there are multiple viewers, the gaze processingalgorithm can use face detection and determine a profile that has to beused in advertisement selection. There could be multiple profilesdefined, depending on the combination of viewers. Also, there can be apriority mechanism in choosing the viewer, when multiple viewers arepresent, when doing the gaze processing.

It will be appreciated that the embodiments described above are cited byway of example, and that the present invention is not limited to whathas been particularly shown and described hereinabove. Rather, the scopeof the present invention includes both combinations and sub-combinationsof the various features described hereinabove, as well as variations andmodifications thereof which would occur to persons skilled in the artupon reading the foregoing description and which are not disclosed inthe prior art.

Aspects of the subject matter described herein may be described in thegeneral context of computer-executable instructions, such as programmodules, being executed by a computer. Generally, program modulesinclude routines, programs, objects, components, data structures, and soforth, which perform particular tasks or implement particular abstractdata types. Aspects of the subject matter described herein may also bepracticed in distributed computing environments where tasks areperformed by remote processing devices that are linked through acommunications network. In a distributed computing environment, programmodules may be located in both local and remote computer storage mediaincluding memory storage devices.

Also, it is noted that some embodiments have been described as a processwhich is depicted as a flow diagram or block diagram. Although each maydescribe the operations as a sequential process, many of the operationscan be performed in parallel or concurrently. In addition, the order ofthe operations may be rearranged. A process may have additional stepsnot included in the figure.

Although the subject matter has been described in language specific tostructural features and/or methodological acts, it is to be understoodthat the subject matter defined in the appended claims is notnecessarily limited to the specific features or acts described above.

We claim:
 1. An apparatus comprising: an IP camera configured tocontinuously capture an image associated with a viewer; a receiver; agaze processor to identify view-gaze coordinates and gaze direction anddetermining a viewer attention area; media content comprising acollection of gaze metadata; a gaze metadata parser to parse thecollection of gaze metadata; and a gaze advertisement module to derivean area of interest (AOI) based on the collection of meta data, whereinthe AOI is compared to the viewer's attention area in order to select agaze advertisement URL for display to the viewer.
 2. The apparatus ofclaim 1, wherein the gaze advertisement module further sends data to anadvertisement server.
 3. The apparatus of claim 2, wherein the datacomprises the gaze advertisement URL that corresponds to the viewer'sattention area.
 4. The apparatus of claim 1, wherein the apparatus is aset top box.
 5. The apparatus of claim 1, wherein the apparatus is alarge display device.
 6. The apparatus of claim 1, wherein the apparatusis a streaming device.
 7. The apparatus of claim 1, wherein the gazemetadata comprises (x,y) coordinates for each advertisable item,quadrant info, an advertisement URL for each advertisable item, a timeframe for each advertisable item, and a video source resolution to whichthe (x,y) coordinates are relative.
 8. A system for creating gazed-basedadvertising content, the system comprising: an image processing modulecomprising: an incoming video stream; an identification module toreceive the incoming video stream and identify at least one advertisableitem in a frame and provide a list of bitmaps for the at least oneadvertisable item; an advertisement library comprising a list ofadvertisable items and a corresponding URL to each advertisable item,wherein the advertisable items are in a bitmap format; and an ad mapperconfigured to compare the bitmaps from the identification module to thebitmaps from the advertisement library in order to provide a URL relatedto the at least one advertisable item, wherein the image processingmodule outputs the advertisable item coordinates from the identificationmodule and the URL associated with an advertisement associated with theat least one advertisable item.
 9. A method to insert advertisement databased on a user's gaze location, the method comprising: detecting a faceassociated with a viewer; extracting select facial features of the faceof the viewer; using a gaze detection algorithm to determine (x,y)coordinates of the viewer attention on a display device; receiving, froma gaze meta data parser, gaze metadata comprising (x,y) coordinatesrepresenting an advertisable item; receiving a video format; calculatingan Area of Interest AOI corresponding to the advertisable item using the(x,y) coordinates and the video format; comparing the AOI with the (x,y)coordinates of the viewer attention; determining if the AOI matches theviewer attention; and if the AOI matches the viewer attention, selectingan advertisement URL and rendering the advertisement.
 10. The method ofclaim 9, wherein the method further comprises a pre-determined timethreshold during the which the comparing the AOI with the (x,y)coordinates of the viewer attention is considered valid.
 11. The methodof claim 9, wherein the gaze metadata comprises (x,y) coordinates foreach advertisable item, quadrant information, an advertisement URL foreach advertisable item, a time frame for each advertisable item, and avideo source resolution to which the (x,y) coordinates are relative. 12.A method for gaze advertisement, the method comprising: detecting a faceassociated with a viewer; extracting select facial features of the faceof the viewer; using a gaze detection algorithm to determine (x,y)coordinates of the viewer attention on a display device, receiving avideo format; receiving, from a gaze meta data parser, gaze metadatacomprising the number of quadrants into which a video frame is divided;calculating using the metadata, the quadrant corresponding to thelocation of an advertisable item; determining which quadrant correspondsto the (x,y) coordinates of the viewer's attention to the display;selecting an advertisement URL; and rendering the advertisement.
 13. Themethod of claim 12, wherein the (x,y) coordinates are representative ofan advertisable item.
 14. The method of claim 12, wherein the methodfurther comprises a pre-determined time threshold during the which thedetermining if the (x,y) coordinates of the viewer's attention to thedisplay corresponds to the quadrant of the advertisable items is valid.15. The method of claim 12, wherein the gaze metadata comprises (x,y)coordinates for each advertisable item, quadrant information, anadvertisement URL for each advertisable item, a time frame for eachadvertisable item, and a video source resolution to which the (x,y)coordinates are relative.
 16. A method for gaze advertisement insertion,the method comprising: capturing, using an IP camera, an image of a faceassociated with at least one viewer; detecting, by an image processor, agaze direction of the at least one viewer, wherein the detection of thegaze is determined by head pose estimation methods; receiving a videostream into a gaze metadata parser, wherein the gaze metadata parsercomprises gaze metadata; determining within the metadata, if theadvertisable item is present in the left half of the frame or the righthalf of the frame; determining if the viewer's gaze direction withrespect to the display corresponds to the determined half of the framewhich comprises the advertisable item; and rendering an advertisement ifthe gaze direction corresponds to the half of the frame which comprisesthe advertisable item.
 17. The method of claim 16, wherein the methodfurther comprises a pre-determined time threshold during the whichdetermining if the viewer's gaze direction with respect to the displaycorresponds to the determined half of the frame which comprises theadvertisable items is valid.
 18. The method of claim 16, wherein thegaze metadata comprises (x,y) coordinates for each advertisable item,quadrant information, an advertisement URL for each advertisable item, atime frame for each advertisable item, and a video source resolution towhich the (x,y) coordinates are relative.